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ABSTRACT 


A canonic system C is a specification of a recursively enumerable set, 
such as a set of strings over a finite alphabet. From this description 
C, it is possible to generate a system Cc called a proof measure func- 


tion, which is an indication of the complexity of the language defined. 
For certain simple but important classes of canonic systems, algebraic 
bounds on these functions can be derived from the structure of the sys- 


tem. Another transformation on C produces a system ct which character- 
izes the recognition of strings generated by C. A relationship exists 


between the measure functions of C and cn}, thus relating the complexity 
of the recognition procedure to that of the language description. 
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Chapter 1 - Review of Canonic Systems 


Introduction 


A canonic system is a recursive definition of the members of a set. 
Instead of enumerating its elements, we specify a set G by first giving 
a finite number of basic members, then rules of the form "If x is in G, 
then f(x) is also in G." Canonic systems provide the framework for 


expressing such a procedure in a formal way. 


For example, let G = faé| k is an even integer} be a set of strings 


over the alphabet {a}. Membership in G can be allowed thus: 


1. aaeG 
2. If x e¢ G, then xaa eG 
3. No other strings are in G. 


A canonic system generating G could be written 


1. faaG 
2. xG + xaaG 


where "| aaG" is another form of the membership axiom "aa ¢ G," and 
"<premise> |- <conclusion>" is a contraction of the "If...then..." 
statement. The fact that no other strings are to be allowed in G is 
implicit in the way the canonic system is to be interpreted. 


- 


Definition 1. A canonic system C is a 5-tuple 


Cc = (8, N, P, R, C') 
where 


§ = an algebraic system (A, Byres) consisting of a set A 


and a collection of operations Wyre es ®y on A. 


N= a finite set of variable symbols, usually {x, y,...,2}. 


a] 
i} 


a finite set of predicates, each of which is assigned a 


unique positive integer called its degree. 


R = a finite set of rules of inference, called canons, whose 


form will be described in Definition 2. 


C' = a canonic system which defines the interpretation of the 


rules of inference in C; its structure and significance 


will be discussed in the next section. 


Previous formulations of canonic systems have tacitly assumed the 
algebraic system S to be the monoid (v", *) of all finite strings over 
an alphabet V under the operation of concatenation. By making the 
algebraic system explicit, we are able to specify canonic systems gen- 
erating sets of integers, real numbers, and so on, as well as sets of 


strings. 


Definition 2. A canon is a rule of inference of the form 


£ (Xp oeeesX DEL A eee A ECR sees XK DF fb BCX 200+ X GE 


where F Sais Fy and G are predicates in P; and fi» aera fi.’ and g 


j’ 
are functions on A formed from Wpree eee by composition. This canon 


would be read: 
If £ (x) 5-00 oX)) has the property Fi; and ..., and 
aC SED a) has the property Fi then B(X,s+++5%,) has 


the property G. 


If Q is interpreted as the extension of a predicate, Q = {x| Q(x)}, 


then the meaning of a canon may be defined as 


G={ul g@ Bienen e eA [£, (2, .++.52,) € Fy A see 


a ae 7 
A COs) eA B(Z,5+0+52,) uj}. 


The quantity F (xy 2-44 s%))Q is called a remark; the remarks 


f(x) 5-- ox DE, are the premises of the canon, and B(X) +++ 5% )G is 
its conclusion. A remark Ff (Kyo + +0 Ky) Q is said to be true iff there 


exist Zp reve sy é¢ A such that F(z, r 0-629) é Q. 


* 
For example, consider this canon over the algebraic system (V , °*), 


where V is a finite alphabet and * is the concatenation operation: 
xB - axaC . 


This is interpreted as, "If x has the property B, then the string axa 
has the property C." Formally, 


~é * ~ 
c={ul@zev [zeB A u = azal} 
= {axa| xe B} if Bis a non-empty set of strings over V . 


Another canon might be 

xaD A yE |} <x, ya F 
whose interpretation is 

~ * ~ ~~ ; 

F = {<u, v>| ax,ye V [xae D A ye E A <u, vw = <x, ya)}, 
where the notation <u, v> denotes the ordered pair of u and v. Infor- 
mally, we are searching D for a string that ends in "a," then forming 
an ordered pair from part of it and any string from E. 

An axiom is a canon with no premises; it is therefore universally 


true. If a function B(X)5+++5%,) appears in an axiom, then Kypreeesk, 


must be allowed to range over all elements of A. Axioms might be | 2N 


or -x’s. 


Definition 3. A remark w G is immediately derived from a sequence 
of remarks OE ores wF, W, e A, iff 
1. £ (x) oe- oe DE, A wee A £Xprer XE FE B(x) 4++-5% GE 


is a canon. 


| ZypssersZ A such that £,(z),-++22,) = Wy) and 


B(Z), 00052) = W. 


Definition 4. A proof of a remark r is a finite List of remarks 


Tiwesre ks, such that 


1. Ty is an axiom, 
2. vy; (1 < is p) is either an axiom or a remark immediately 


derivable from some subset of {ryoeeeoTy ats 
To illustrate these definitions, let us consider the system 
* 
c= (({a, bt’, °), {x}, {A}, R, C'), where the canons in R are 


f PA 
xA + axaA . 


This system contains the single axiom | bA, so it is trivially provable 
that bA. A single application of the canon xA }+ axaA yields the con- 
clusion abaA. Continuing this process, we obtain the proof 
bA, abaA, aabaaA,'...; a"ba"A . 
Thus A = {a"ba™ | n2 0}, a context-free language. 
We occasionally use the phrase "the language (or the set) generated 
by the system C" to mean the set G defined by some predicate G in C, 


Either the appropriate predicate will be obvious, or the selection of G 
will depend upon the situation in which the result is applied. 


Hierarchy of Canonic Systems 


A proof in a formal system usually consists of a finite number of 


repetitions of two operations: 


1. Substituting elements of A for variables, 


e 


2. Inferring Y from X and X ~ Y. 


These are operations which can be carried out by a mathematician with 
paper and pencil. In order to be able to formalize the idea of a 
proof, Moore and Donovan [ 3 ] allowed a canonic system C' to describe 
the concept of a "proof" in a system C. A hierarchy is thus formed, 
since a proof in C' is, in turn, described by a system C". For some k, 


(k+l) = ok) so that no new features will be found in'higher level 


systems. 


For example, consider the system 
* 
C = (({a, b} , +), {x}, {A}, R, C'), where R consists of the 


canons 
1. -bA 
2. xA faxaA. 

K = {b, aba, aabaa, ...}. The second level system C' will generate the 


proofs that each of these strings is defined by A. In other words, C' 


will generate the set 
{bA, bA;abaA, bA;abaA;aabaaaA, ...} 
where we have used the semicolon only to avoid a collision of punctuation. 


This set can be considered a set of strings over the alphabet 
{b, A, ;}, and as such it may be generated by some canonic system. 
Define C' to be that system. Its canons describe the effect on a proof 


of applying each of the rules of C: 


1. A proof 
2. q proof - q;bA proof 
3. q proof A B;yAS proof |» g;ayaAs proof 


* 
Then C' = CChss b, A} » *); {as B, Y,§ } {proof}, R, Cc). It is not 
difficult to see how C' operates. First, for convenience, it defines 


10 


the null string as a proof. Then, if some proof q has already been 
derived, & followed by the symbols ";bA" is still a proof because kbA 
is an axiom of C. Finally, if a proof q@ exists such thet the pattern 
";<some string y> A" appears within it, @ followed by the symbols 

"; a<some string y>a A" is also a proof. This corresponds to applying 
the canon xA fF axaA in C. 


This formulation of higher-level systems is considerably simpler 
than their original description [ 4 ]. Since we shall have occasion 


to use such systems later, this transparency will be beneficial. 


Summary 


This chapter has been a review of the definitions and terminology 
associated with canonic systems. We have revised some of the formulations 
so that they are applicable over arbitrary algebraic systems; this will 
allow us to write canons describing, for example, sets of integers, as 


well as sets of strings. 


Chapter 2 - Proof Measure Functions 


Introduction 


This section develops a measure of complexity for sets of strings 
specified by a canonic systems. Such a measure is related to the time 
it would take a generator operating from the canonic system to produce 


an element of the language being described. 


Definition 1. A proof of a theorem t in a system C is a finite 


oot such that 


list of remarks r k 


pee 


Ty is an axiom 
2. ry (1 < i<k) is either an axiom or a theorem 
immediately derivable from some subset of {FyoeeeoFy yd: 
3. r= t. 
Recall that, ina given canonic system C, the meaning and derivation 
of a proof is given in the next higher level system C'. Since C' is 
capable of generating all possible proofs of theorems in C, we make the 


following definition. 


Definition 2. The proof measure function m(C, t) of a theorem t 


in a canonic system C is the number of remarks in the shortest proof of 
t. That is, if ty r is the shortest proof of t in C, then 
m(C, t) = k. 


»r 


grrrety, 


For example, in a system S containing the canons 


1. bA 
2. xA } axaA 


we would have the proof bA, abaA, aabaad. Therefore m(S, aabaa) = 3 


since there is precisely one proof for each string in this system. 
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Properties of m(C, t) 


We may now prove several properties of proof measure functions. 
Theorem 1. If t is a theorem of C, then mC, t) is computable. 
Proof. If t is a theorem of C, then there exists a proof of 
t in C: Th Foreeesky such that t = Ty This proof, considered as a 
string of symbols, is a string derived by the next higher level system C'. 


Modify C' into Cc 80 that it keeps count of the length of the proof it 


is constructing for t. Change the axiom 


k % proof 


into 
ft <A, O> proof g length. 


In each canon of C' which places a remark in the proof, replace the 
predicate proof with a predicate over ordered pairs, proof & length. 


' 


Thus, if C' has the canon 
@ proof ~ ... -- a;B proof, 
let C. have the canon 


<a, n> proof & length A... HK «v8 ,ntl> proof % length. 


Finally, let Sar have the canon 


<1;B, n> proof & length } <B, n> theorem % m t 


to associate with each t the length of its proof. Then the canonic 
system Cc. computes m(C, t). QED 


An Alternative Function 


Given that t is a theorem of C, the function m(C, t) gives us a 
measure of the difficulty of proving t in C. Unfortunately m is a 
function of each possible theorem. It would be desirable to sacrifice 


some of the exactness of m in favor of a more macroscopic quantity. 


4 
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Definition 3. Let m'(C, n) = sup {m(c, t)| t is provable in 
C and {t| = n}. . 


The function m' looks at all proofs of theorems of length n in C 
and takes the number of steps in the longest proof as its value. Thus, 
given a theorem t of length n, we can, under certain conditions, say 
that it may be proven in no more than m'(c, n) steps. By definition, 
m'(C, n) is the upper bound on the length of proof of a theorem of 
length n. We now consider the conditions under which it is meaningful 


to speak of m'. 


Theorem 2. m'(c, n) is computable if and only if C generates a 


recursive set. 


Proof. Consider the system C to be over a terminal alphabet T. 


Generate the set 
* 
s.7 {we T | w = n}, 


the set of all possible formulas of length n over T. For each wy ¢ s 
we can determine whether or not it is a theorem in C since C generates 


a recursive set. That is, generate 
s = fy eS | w is provable in C}.. 

Then, since m#(C, w) is computable when w is known to be provable in C, 
m'(C, n) = sup {m(C, w)|w e¢ s'il. 


If m'(C, n) is computable, then if any formula a is provable at 
all, it is provable in no more than m'(C, ja]) steps. Since C contains 
a finite number of axioms and canons, it is possible to generate all 
proofs of length 1 (the axioms) , length 2 (one canon applied to each 
axiom), and so on. If the theorem q@ has not been proven after m'(C, {Q| ) 
steps, it will never be. Since the membership question is thus decidable, 


C must generate a recursive set. QED 


There is a property of m' which is significant for its application 
to complexity measures for programming languages. 


Theorem 3. m'(C,n) is not necessarily monotonic increasing in n. 
Proof. Consider the counterexample 

+ aad 

xA - xaad 

F aB 

xB } xaaB 

XB F xA 


which generates the proofs 


1. aB, aA 

2. aad 

3. aB, aaaB, aaaA 
4. aaA, aaaad. 


Then m'(C,1) = 2, m'(C,2) = 1, m'(C,3) = 3, m'(C,4) = 2, and so on. 
QED | 


This property is entirely expected, for some short theorems may 


be much more difficult to prove than long ones. 


A More Precise Function 


The functions m and m' evolve naturally from a consideration of 


the process of proof in a canonic system. They do provide an indication 


of the difficulty of deriving a given theorem, but they make the 

' simplifying assumption that one canon is no different from another 
since each extends the proof one step. But certain rules may contain 
several premises, each of which must be examined before the conclusion 
may be drawn. Let us define another function which takes this into 


account, 


Definition 4. The proof measure function n(C, t) of a theorem t 
in a canonic system C is the total number of premises evaluated in the 


shortest proof of t. That is, if r »xr, is the shortest proof of 


qt k 
t in C, and if the canons applied during this proof were 
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Pin A Ta Avr °A “mk FT. 


k 
then n(C,t) = ra m 
j=l J 


Theorem 4. n(C, t) is computable iff t is provable in C. 


Proof. The proof is almost identical to that of Theorem 1. Build 
the canonic system C so that, at each step in the proof of t, a count 


is kept of the number of premises which have been evaluated so far. QED 


Definition 5. Let n'(C, 4) = sup{n(C, t)| t is provable in 
Cand jt|= g}. 


Theorem 5. n'(C, g) is computable iff C generates a recursive set. 
Proof. Similar to that of Theorem 2. QED - 


Theorem 6. If p is the number of axioms appearing in the shortest 
proof of t in C, then n(C> ¢) 2 m(C, t) - p. 


Proof. If ‘ axioms appear in the proof, then m(C, t) = p + x, where 
x steps are taken in ithe proof after the axioms are written down. The 
number of premise remarks evaluated for each of the axioms is zero; for 
the x remaining steps, y 2x premises must be examined since each canon 
has at least one premise. Thus n(C, t) = O+ y. These three conditions 
imply n(C, t) = m(C, t) ¥ p- QED 


Theorem 7. If C is a canonic system possessing q axioms, then 


n'(C, g) = m'(C, 2) - q. for all 4, 


Proof. At worst, some theorem of length ¢ may require the use of 
all q axioms in its proof; in any case, it requires at least one. The 


conclusion follows from this observation and Theorem 6. . QED 


Bounds on Measure Functions 


Consider a canonic system C generating sets of strings. We would 
like to relate at least one of the proof measure functions to the 
structure of the system C in such a way that we could avoid having the 


function described by only another canonic system. Let 
_ D(k) = [x |x is provable in exactly k steps in C}. 


Then, by definition, D(1) is the set of all the axioms of C, D(2) is the 
set of all theorems provable by applying exactly one canon to the strings 


in-D(1), and so on, Furthermore, let 


L(k) = sup {x| x ¢ D(k)}, 
S(k) = inf {x| x ¢ D(k)}. 


That is, L(k) is the length of the longest theorem provable in k steps, 
and S(k) is the length of the shortest. 


Assume that the canonic system C has the property that no canon 
ever "shortens" a theorem. That is, if t is provable in k steps and t' 
ink +1, then |t'|, 2 [t|. Under these conditions, we can say that 
L(k + 1) 2 L(k) and S(k + 1) 2 S(k) for all k. 


Recall that L(k) is the length of the longest string possible after 
a derivation of k steps. Then the inverse function uh (2) is the mini- 
mum number of proof steps required to generate any theorem of length z; 
all proofs of fewer steps yield strings shorter than z. Now the proof 
measure function m'(C, z) is, by definition, the maximum number of proof 


steps required to generate all theorems of length z. We therefore obtain 
Lemma 1, u ¢) sm'(C, z) if C is non-erasing. 


A similar argument may be advanced for the S(k) function. Since 
s7h¢z) is the maximum number of proof steps required to generate all 
theorems of length z, all longer proofs yield strings of length greater 


than z. This observation produces 


Lemma 2. m'(C, z) 8 s7*(z) if C is non-erasing. 
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and 


Theorem 8. If C is a canonic system with the property that t 
provable in k steps and t' in k + 1 implies |t'|2= |t|, then 
u (2) <m'(C, z) < sh ¢z) for all z. 


Computation of L(k) and S(k) 


Consider a system C generating strings over an alphabet V in 
which each canon is either an axiom or a rule of the restricted form 


xF - W XW5G 


where Wis Wy ¢€ Vv. Let 4, be the length of the longest axiom in C 

(4, = gup {1%} |r is an axiom of C}) and g be the maximum number of 
symbols added to a derived string by the application of a single canon 
(2 = sup {11% | xF W XW 5G is a canon of C}). Then we observe that 


LQ) = 4, 

L(2) = L(1) + £ 

L(k) = L{k-1) + 4 
so that 

L(k) = 45 + (k-1) 4. 


Similarly, if §, = inf {1%] {Fe is an axiom of C} and 
s = inf [|¥,%)| | xR b+ wxW,G is a canon of C} describe the shortest 
starting string and the least number of symbols added by any canon, then 


S(k) = 85 + (k-1)s. 
The inverses are easily found to be 
-1 
Lo@ez- 444 
Le 


el 
S “(z) se een 
8 
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Noting that canons of the form used never "shorten" a theorem, we may 


combine these results with Theorem 8 to yield 


Theorem 9. If C is a canonic system in which all canons are either 


of the form}wW or xF + w,xW,G, then 
ee m'(C, z) <7 ~ 30 + 8 where 4,» t 8, 8 are as 
L ) 


defined previously. 


The class of canonic systems yielding exactly the regular sets 
requires canons of the form either } aG or xF +} xaG, where a is a single 
symbol from V. Thus ies L= s, =s = 1, and Theorem 2 produces 


z<m'(C, z) <2. We have proven what may be obvious: 


Corollary 1. If C defines a regular set, then m'(C, z) = z. 


Now consider a system C generating strings over an alphabet V in 


which each canon is either an axiom or a rule of the restricted form 


oe x eee 
XE) Av AXLE f Waxy OVX Waits ns 
1 2 n.. 
R 
where ws eV, x, 6 {Xpreee ox fs and n> 1. 
Define 
1. Tax = SUP {n | FE) A X5E5 AeeeA xen + B(x) +6 +X GE 
is a canon of C} 
2. a inf {n| xiF, A XoF 5 AveeA x En - B(X,5++- 9% DG. 
is a canon of C} 
3. 4, = Sup {|} | Fw is an axiom of C} 
4. £= sup [Jww, ... | XiF) AeA XE, F 
WX, WX, ++. Wx, WE is a canon of C} 
1 20 n 
5. 8s = inf {|u| [bw is an axiom of C} 


6. S =i. We SNe fae 
ene elas 1 vn | *,E) Bese a e 
WwW xX, Wx .-. W xX, WG is a canon of C andn=r }. 
oi, 1 i, n-1 i,m min 


The definitions of Ayr fs 85> and g are similar to those given 
in the case where each canon is allowed to have no more than one premise. 


min’ the maximum and the 


minimum number of variables appearing in any canon. Again we desire 


to find L(k) and S(k), the lengths of the longest and the shortest 


In addition, we desire to know r and r 
max 


string derivable after a proof of k steps. 


By definition, L(1) = 4) the length of the longest axiom. Now 
the longest string derivable by the application of a single canon would 


be produced when the longest strings possible were assigned to 


> Peay in the canon 
1 r 
max 
Ey A pkg Metre Be Mo Oty ere! Wee Ne 
max “max 1 2 r max 
max 
and [ww ree WD | = g. Such a canon may or may not exist in C, but 
max 


no canon in C produces strings growing at a faster rate. Thus we observe 


that 
L(1) = 4, 
L(2) = Tax) ee 
L(k) = T axe (k-1) + 4 
so that Bai k-2 ; 
L(k) = Ie ax? = £2 nag) 
k-1 
me pte ykrl ‘ aie gas 
4, max r -1 
max 


‘Solving this equation for k to produce the inverse function, we obtain 
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2a(r =e LI +h 
max ) where r ae 
LZ max 
o 


ule) = 1+ log, ( 
max 


We now desire to Find an expression for S(k), the length of the 
shortest string derivable after k applications of canons of the form 


N wee AXF - wx G. 


x,F WaX, eee W x, W 
1 wn ° i, 1 1, nel i n 


1 


In this canon, we must substitute strings for n variables. If the 

length of the string produced as a conclusion is to be as short as 
possible, each variable should be assigned the shortest possible string. 
However, some variable must refer to a string produced in the previous 
proof step, or there would be no reason to have that step. This would 
contradict our assumption that all proofs are of minimum length. We thus 
produce the shortest permissible result when one string x, is assigned the 
length S(k-1) and n-1 have the shortest possible length, 86° Once again, 
there may or may not be e canon which allows this, but all other sub- 


stitutions will produce longer conclusions. 


In addition, we desire to use the canon with the least number of 
variables in its conclusion. This canon, by definition, contains constant 
strings of total length s. For sufficiently large string lengths, the 
effect. of concatenating several long strings is greater than just adding 
a constant number of symbols each time. Thus we expect this analysis of 
S(k) to be valid for only large k. 


The expression we have justified for S(k) is 
S(k) = S(ke1) + 84h in -“l)+s. 
The solution of this equation is 


S(k) = (ke1) (sory +s) - (ke2)s) i 


' whose inverse is 


“ z+s (xr, -2z) +8 
S we) eo min 


9 ta 1) +s 


Combining these results, we obtain 


Theorem 10. If C is a canonic system in which all canons are either 


ELA +++ AXE fF wx 


of the form }+w or x5) e oe coe W GX we G, then 


n=l i 
n 


z(r_ wl) +h z+s(r_, <2) +s 
1+ log. ( max ) < m'(C, z) < of min ) 
max L s (r_.-l) +s 
fo) in 
for sufficiently large 2, r > 1, and s,s, £_, and £ as defined 
max o ° 


previously. 


The class of canonic systems yielding exactly the context-free 
languages requires canons of the form either + aG or xA A yB + xyG, where 
a is a single symbol from an alphabet V [4]. Then Pel 4, =l,s=he=0O, 


andr, =r = 2, and Theorem 10 reduces to 
min max 


Corollary 2. If C defines a context-free language in standard form, 
then 1 + log, zsm'(C, z) Sz. 


Summary 


We have defined four functions measuring the complexity of a 
derivation in a canonic system. These functions were shown to have 
certain properties, and relationships among them were derived. For 
canons in which only the identity function appears in the premises, we 
derived bounds on some of these functions in terms of the structure of 


the structure of the system. 


There is a question of interest which we have not settled: Given 
a system C, is it possible to find a system C* such that C* is equivalent 
to C, but m'(C*¥, 2) < m'(C, 4) for all £? In other words, is it possible 
to "speed up" a canonic system in the same way that a Turing machine can 
be speeded up? The answer to this question will determine the nature of 


a lower bound on our measure functions. 
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Chapter 3 - Inverse Systems and Compiling 


Introduction 


A canonic system C is a specification of the members of a set and, 
as such, gives rules for generating successive elements of that set. 
Because of its similarity to the parsing problem, the inverse process is 
‘more interesting: given a string w, how could it have been generated by 


C? The operation is one of analysis rather than synthesis. 


An algorithm which accepts a canonic-system description of a language 
and analyzes an input string based on this description exists [1]. We go 
one step further and create from C a new canonic system co which exactly 


describes the recognition procedure for strings generated by C. 


For example, consider the language 2 = fa'ba” | n 2 0} defined by the 
system 
t bA 
xA faxad . 


A proof in this system C that aabaaA is 
bA, abaA, aabaad . 
Consider the system 


axadé cA 
bAEI 


where the conclusion I indicates an arbitrary terminating condition. When 
this system is presented with an axiom Fw such that w is provable in C, it 
proceeds to decompose that string. For example, if we include the axiom 


} aabaadA, the system becomes 


+ aabaad 
axaA }+ xA 
bA FI 


and generates the proof 


aabaaA, abaA, bA, I . 


Apart from the end-of-proof indicator I, this is simply the proof in C 


written backwards. The system we have just presented is Ge. the inverse 


of C. 
Construction of cnt From C 


The relationship between the proofs in et and C suggests a procedure 


for obtaining the canons of cnt from those of C: interchange the premise 


and the conclusion. Thus if C has the canon 
xA | axad , 

~1 

Cc should have 
axaA + xA . 


This procedure fails if the original canon had multiple premises, for the 


inversion of the general canon 

£ (xp eeee RED AweeA Espero Eb B(X)1+++sX DG 
would produce 

B(x, 5-4 x DG a £ (xp ree KDE, AeesA Fahy Ey : 


and we do not define the meaning of a canon with multiple conclusions. 
(There is no way of indicating in a proof that several conclusions must 
hold simultaneously.) Thus only systems C in which each canon has no 
more than one premise possess an inverse cnt by this construction, It is 
the case, however, that any arbitrary system can be reduced to this form, 


and this we now demonstrate. 


Normal-Form Systems 


Given an arbitrary system C, it is possible to derive a system Cy such 


that 


1. Each canon in Cy has a single premise. 
2. A remark t is provable in Cy iff t is provable in C. 


The general canon we must consider is 
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£ (xy 50+. DE, AeeeA Esp ox DE, } B(x, s+++sx DG 


where 
d 
n i 
fj: A 3A 
since the canon is over the algebraic system (A, Wren eh) > We shall 
call dq, the degree of £3 the result of applying £5 is a d,-tuple of 
elements from A, and F. must be a predicate over d,-tuples. For con- 


venience, let us abbreviate this canon as simply 


£,F, Aves 2 em F gG . 
The following algorithm produces Cy from C: 


1. If C has either the axiom |} wG or a canon fF} gG with a 
single premise, then retain these rules in Cy: 
2. If C has the canon 
fF AeeeA frre 3 


then let Cy have the canon 


fipseeesf > Fie F b aG 


™m 
and the new predicate F_...F_ of degree (>_ d,). 
1 m i=l i 
3. If Cy has the canons 


<Tjseeesr > RE uk’ 


, : | 
“* then let C, also have the canon 


<Ep seco eT 58) 900058 > BS - cu, v> R'S! 


P q 


and the new predicates RS of degree (degree R + degree S) and 
R'S' of degree (degree R' + degree S'). Repeat step 3 for 
each pair R'S' describing a predicate created by rules 2 and 
3. Since C has only a finite number of canons, this’ process 


will terminate. 
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4. No other canons are in C.. 


This algorithm reduces several premises to a function of higher 
degree. Rule 1 retains all those canons which already have only a single 
premise or none at all. Rule 2 reduces a canon with several premises to 


the desired form by combining the dq, results of each function f. (1 5 is m) 


m 
into one (>. d,)-tuple. When an n-tuple is evaluated, all components must 


i=1 
be shown true, so the effect of requiring the proof of all the original 


premise remarks is achieved. When canons are combined in this way, new 
predicates are created in the premises, and Rule 3 provides the canons to 


describe these. This rule provides for the interaction between objects 


described by several canons. 


For example, let C be the system 


k ad 

F bB 

bcc 

XA} axA 

xB + bxB 

xf - cx 

xA A yB + xyD 
*xC A yD k yeyE 


which generates & = L, 


mm km™ | njm,k 2 1}. By Rule 1 of the algorithm, 
Cy contains the canons 


b aA 
+ bB 

ec 

XA + axA 
xB } bxB 
xC |-cxC . 


By Rule 2, Cy contains 


<x, y> AB + xyD 
<, y> CD - yxy E 


corresponding to the two canons with multiple premises. 


definitions for the sets AB and CD; applying it to the canons 


in C 
n 


which define A and B separately produces the set of rules- 


<x, yoAB + 
xA + 
yB F 

Fk 


wax, by>AB 
<ax, b>AB 
<a, by>AB 
<a, b>AB 


Rule 3 provides 


The definition of CD generates the canons 


<x, y, z>ABC + 
<<; y>AB fF 


A new predicate, ABC, has 
applying Rule 3 again: 


<cz, xy>CD 
<x; xy>cD 


been generated, and it must be defined by 


<x, y, z>ABC + <ax, by, cz>ABC 
<y, z>BC + <a, by, cz>ABC 
<, yoAB + <ax, by, c>ABC 
<, zpAC + <ax, b, cz>ABC 

xA + <ax, b, c>ABC 

yB + <a, by, c>ABC 

zl + <a, b, c>ABC 


The predicates BC and AC must be defined, again by Rule 3: 


<=, yoBC 
xB 
yé 


F 
Fk 
- 


<xb, yc>BC 
<xb, c>BC 
<b, ye>Be 


<b, c>BC 


oO oe oe 


No new predicates were created in this step, so the generation of Cy is 


complete. 


<xa, yo>AC 
<xa, c>AC 

<a, ye>AC 

<a, crac 


Consider the proof of aabbbcaabbbE in the original system C: 
bB, aA, bbB, aaA, bbbB, cC, aabbbD, aabbbcaabbbE . 
The system Cc, pee this derivation as: 
bB, <a, bb> AB, <aa, bbb> AB, <c, aabbb> CD, aabbbcaabbbE . 
We now show that C, generates the same sets as C. 


Theorem 1. A remark t is provable in Cy iff t is provable in C. 
Furthermore, the proof in Cy is” effectively obtainable from that in C, 


and conversely. 


Proof. To show that t provable in C implies t provable in Cy» we 
will use induction on the length of the proof in C. 


Basis: If t is provable in one step in C, then t, being an axiom, 
is also provable in one step in Cy: Similarly, if t is provable in two 
steps. in C, it is the result of applying a single-premise canon to an 
axiom in C. By Rule 1 forming Sy from C, these operations may sine be 
performed in Cy 


Induction: Assume all theorems provable in less than or equal to 
k steps in C are also provable in Cy: A proof of length k+l of a theorem 


t in C is a list of remarks Kyreeead such that 


k+l 


1. vy is an axiom. 


2. ty (1 <is<ktl) is either an axiom or a remark 
immediately derivable from some subset of {Fyo+eeoFy_y)- 
Mtl = t. 

t is not an axiom of C since it is assumed to be provable in no less 


than k+l steps. Therefore r. is immediately derived from some subset 


kt1 
{W,F +--+, BE} of {r,5++-s¥,} by the application of a canon 


£,(e1s+++ KEY A ve A fpr oF k B(X,5---»%,)G 


where there exist Zqreees% & A such that f p(Zpore 9%) = W, and 


| B(2,5---s2,)G = Teas Furthermore, each w,F, is provable in less than 


i=t 
or equal to k steps. 
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From these facts about the proof in C, it is possible to con- 


struct the proof in C When C derives r by applying the canon 


N° kt+1 


£1 (x) >++-* DE) A wee A EG XE, F B(x, 5--- XE : 
C,, uses 
yo-+-.f > Fook. + B(X,5+++.*,)G 


generated by Rule 2 forming Cy from C. Each component of the m-tuple 
is provable in less than or equal to k steps in C; by the inductive 
assumption, each is therefore also provable in Cy as is the entire 
m-tuple. Then, as long as the predicate Fue F appears in the con- 
clusion of some canon, Tet may also be proven in Cy. But Rule 3 of 
the construction algorithm insures that all predicates appearing in 
premises appear on the right-hand side of some canon, so Tey 18 
indeed provable in Cy: 


Thus the existence of a proof of arbitrary length of t in C 
implies the existence of a proof of t in Cy 


It is now necessary to show that t provable in C, implies t 
provable in C. The proof of t in CG, is a list of remarks Byreee 8 
such that 


£ 


Lee: 8, is an axion. 


2. 8; (1 <i < 4) is immediately derivable from Si4° 


The following procedure reconstructs the proof of t in C. Con- 


sider, with no losa of generality, each remark 8; to be of the form 


a Q)++:9 where p 2 1. 


In the construction of Cy from C, multiple premises in canons are 
reduced to components of one n-tuple. If we know the predicates in C, 
we can look at such a remark in the proof in Cy and decompose the one 
term into several. That is, the remark 
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: <dy>- ed ye 


in C, becomes the set of remarks 


N 
QQ) +> 49, 


in C, where each Q, may be a predicate of arbitrary degree. Continuing 
this procedure on each remark in the proof Byars By of t in Cy, produces 


the proof of t in C. Therefore t provable in C, implies t provable in 


Cc. QED 


N 


Measure Functions of Normal Systems 


Theorem 2. n(C,, t » and 


n'(C., 1) = m'(C,, £2) for all g. 


) = m(C,; t) for all t provable in Cy 


Proof. These are trivially true since each step in a derivation 


in Cy requires the evaluation of exactly one premise. QED 


Theorem 3. If y is the maximum number of premises per canon in C, 
m(Cys t) 


then m(Cy; t) smc, t) <¥ — for all t provable in C. 
ve 


Proof. Figures 1 and 2 illustrate the correspondence between 
derivations in Cy and C. We observe that each node in the Cy tree 
expands into at least one and at most y nodes in the C tree. Both 
trees have the same number of levels (the number of nodes in the Cy 
tree is equal to the length of the longest path in the C tree), so 


that if the top level in C,, expands into v nodes, the next lower level 


N 
could contain as many as Vey = Vv, and so on. Thus 


m(C, t) 2 m(C, t) 


m(C, t) s v 


so that 


m(C., t) < m(C, t) < 


“etl 
es - Oem 
eyo 31 $485 


Figure 1 - Derivation in System C 


” § 


7 SiS5es08 


re oe 8922253 


Figure 2 - Derivation in System Cy 
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Inverses of Arbitrary Systems 


Recall that normal-form systems were introduced to show that any 
canonic system C can be reduced to another system Cy in which each canon 
has only one premise. The difficulty in constructing the inverse system 
co} from an arbitrary C arose exactly in the consideration of canons with 
multiple premises. The normal-form theorem resolves this problem by 
always providing an equivalent system of the required form. We may now 


formally define an inverse system. 


co} is constructed from a system C in normal form by the following 


algorithm: 


1. If C has the axiom | wG, then let cl have the canon 
wG -} I, where I is an arbitrary end-of-proof indication. 


2. If C has the canon E(x, o0++5% JE '. (XK, s+++ x DG, then let 
ct have the canon B(x) 1-+-9% GE - £(xX) s+++5x DF. 


Let Cy {a} denote the system C with the added canon a. The inverse 
system has the following property. 


Theorem 4. t is provable in C iff I is provable in ct u {Ft}. 
Furthermore, the proof in cv! is effectively obtainable from that in C, 


and conversely. 


Proof. If t is provable in C, then there exists a proof Tyrceesky 


of t in Cy such that 


1. ry is an axiom. 
2. ry (1 < is k) is immediately derivable from r 
i Pn 


i-1” 


ae 


If the axiom +t is given to et, the only proof possible must begin t,... 
since ct has no other axioms. In the proof of t is Cy k 


derived from r by applying a canon 


k-1 
£(x,s+++% )F - (XK) o-+9X dG . 


where there exist Zpreeesz € A such that £(z1 5000s 2 DE = They and 
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B(Z1 500052 )G = Th: By the construction of cv} from Cy? the inverse canon 


B(X15-+--x DG be £(X) 2040 XE 


is available in c, so the proof in cv! can continue: t, a - This 
process may be repeated until the axiom ry is reached: t, Teepreery 
Then, since the canon rb I exists in the inverse system if Fry was an 


axiom in Cy the proof terminates tyr) pocees ty » proving I. 


If I is provable in chy {tt}, then there exists a proof 
CB reer 8) at in that system. The last canon applied must have been 
81 F I, since these are the only canons which generate I. By the con- 
struction of cnt from Cy Sy must have been an axiom of Cy. At each 


step in the proof where s is deduced from 8, by the application of 


i+] 
a canon 


B(X)s+00sX )G - £(X) +++ JE r 
the inverse rule 
E(x, +++ oX EF bk B(X)5+++ XG 


exists in Cy and may be applied. After k-1 steps, we arrive at a proof 
Beyer 8y in Cy: A final application of the inverse of the canon which 


produced t from s, produces the desired proof. QED 


1 


Measure Functions of Inverse Systems 


The proof of Theorem 4 can be extended to indicate the relationship 


between the length of the proof in ov! and that in Cys 


eo 


Theorem 5. ate U{Ft}, I) = m(C,, t) + 1 for all t provable in Cy. 


Proof. The result follows immediately from a consideration of the 
construction of Theorem 4. The proof in the inverse system is simply the 


reverse, remark by remark, of the proof in Cy» plus the end-of-proof signal 


I. QED 


Corollary 1. m'¢c7t U{Ft}, |z]) = m'(C., |t|) + 1 for all t provable 
in Cy: , 


Proof. Immediate. 


An interpretation of these results is beneficial at this point. The 
corollary states that if t is known to be provable in GC, in no more than 
k = m' (Cy, |t]) steps, then the inverse system ct, given the single axiom 


kt, can reproduce the proof in no more than k+l steps. 


Theorem 3 of this chapter states that m'(C. £) < m'(C, &£) for all &. 
This fact and Corollary 1 produce 


Corollary 2. m'(c"+ U fF te}, {I]) < m'(c, | t|) + 1 for all t provable 
in C. 


Proof. Immediate. 


Inverse Systems As Recognizers 


Suppose the canonic system C describes a programming language. The 
system cnt behaves like an analyzer for that language since, given any 
string derivable in C, ct can produce a parse of that string. The set 
of all strings derivable in C is simply the set of all legal programs in 
the language described by C. Once the m' function is known for C, it is 
possible to characterize the recognition process of a-legal program in 


that language by applying Corollary 2. 


While m'(C, £) + 1 is an upper bound on the number of steps in the 
parse in ct, nothing we have said helps construct this parse. cv!) Like 
any canonic system, is a non-deterministic statement of transformations 
which may be applied at any step in a proof. In general, more than one 
canon may be applicable, and the correct rule to choose may not be im- 


mediately obvious. 


Recursive Sets And Programming Languages 


* Canonie systems in their most general form define recursively 
enumerable sets. Given a particular string w and system C, it is not 
possible..to.determine in general whether w could eventually be generated 
by C. A recognizer based on a recursively enumerable language may never 


halt for some input strings, and this situation is undesirable for 
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practical computing systems. 


One solution is to allow only language definitions which define 
recursive sets. It is undecidable, however, whether an arbitrary canonic 
system has that property (Appendix I). Another approach is to restrict 
the form of the system so that its language is known to be recursive. 

The correspondence with Chomsky's grammars does this, but the forms pro- 


duced do not resemble the definitions of programming languages. 


Let us examine the question from a different point of view: we are 
trying to insure that a compiler presented with an invalid program will 
not loop endlessly. In theory, a machine capable of examining any 
arbitrary string and deciding acceptance or rejection must be based on 
a recursive language. In practice, compilers are not usually asked to 
process completely arbitrary inputs. A source program may be correct 


except for the duplication of a statement label or a missing declaration. 


It is possible then to write the language description in such a way 
that common errors are detected. In effect, we are defining a new language, 
similar to the desired one except for well-defined errors. We still cannot 
guarantee our compiler will halt for arbitrary inputs, but "almost cor- 


rect" programs. will not cause infinite looping. 


For example, consider generating DO loops in Fortran of the simplified 


form 


DO <label> <integer variable> = <constant>, <constant> 


<label> CONTINUE 


Most Fortran systems impose the restriction that the value of the DO 
index may not be changed within the loop; that is, 


pO 5 J = 1, 10 
J = 20 
5 CONTINUE 


is illegal. Assume. we have defined in a canonic system the predicate 


eee 
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not _in such that <x, y> not in is true iff the element x is not contained 
in the set y, and <x, y> in if it is. A specification for a legal DO 
loop is: 


<x, y> set of Fortran statements with assigned variables 
A z integer variable A <z, y> not in A w statement label 


A i integer constant A j integer constant 


Dwz=i, j 
}- x legal DO loop . 
w CONTINUE 


To detect the case of a program redefining the value of the index, we 


might write: 


<x, y> set of Fortran statements with assigned variables 
A z integer variable A <z, y> in A w statement label 


A i integer constant A j integer constant 


Dwz= i, j 
- x program attempting to redefine DO index . 
w CONTINUE 


Thus a program with a syntactic error is recognized as such, and 
we may be assured that the compiler will not loop on receiving an incor- 
rect DO loop. The language defined by the entire canonic system may not 
be recursive in the formal sense, yet we may be able to determine member-~ 
ship in some set for most input strings we are likely to encounter. The 
language description might be called "almost recursive," and a recognizer 


based on that description would halt for "almost all" inputs. 


Summary 


This chapter has presented two main results. First, we showed how 
an arbitrary canonic system C could be reduced to an equivalent system Gy 
which contained only single-premise canons. From this normal form, we 
developed the idea of an inverse system ct which characterized the 
recognition of strings generated by C. In addition, we computed the 


measure functions of both Cy and co} and showed the relationship between 


them. Finally, we developed a technique for developing ‘almost recursive" 
language descriptions so that recognizers would not be 1:.kely to loop on 


most inputs. 


Inverse systems as we have defined them do not specify parsing 
algorithmns for the languages they recognize. An open quastion is whether 


inverse systems can be so extended, and under what condit:ons. 
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Appendix I - Systems Describing Recursive Sets 


Introduction 


A canonic system can specify the elements of any recursively 
enumerable set. However, from at least a theoretical point of view, it 
is desirable to know under which conditions the set generated is also 
recursive. Doyle [4] demonstrated equivalences between canonic systems 
generating sets of strings and the formal languages of Chomsky [ 2 ]. 
Since type 1, 2, and 3 languages are recursive sets, the related 


canonic systems also generate recursive sets. 


In this section we consider canonic systems generating set of 
natural numbers. It is difficult to describe functions of strings, 
but any operation on natural numbers can be characterized by some 


partial recursive function. First, recall three definitions. 


A set Q is recursively enumerable if and only if it is the range 
of some total recursive function. Equivalently, there exists some 


Turing machine which will generate all members of Q in some order. 
The characteristic function Cy) of a set Q is defined to be 


1 if xeQ 
Co(x) = 
QO if x4Q. 


A set Q is recursive if and only if its characteristic function is 
total recursive. In other words, there exists a Turing machine which, 
given any x, will decide either xeQ or x4Q. A set is recursive if and 


only if both it and its complement are recursively enumerable. 


Canonic Systems and Recursive Sets 


Let N be the set of natural numbers, and consider canons over the 
algebraic system (N, +). We now prove two theorems relating canonic 


systems and recursive sets of natural numbers. 
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Theorem 1. If a set QGN is recursive, then it can be generated 


by a canonic system in which the only canon describing Q is of the form 
fOOR b xQ 
where f(x) is a total recursive function. 
Proof. Since Q is recursive, its characteristic function 


1 if xeQ 
O if x4Q 


is total recursive. Let f(x) = Cox), and include the axiom /1P. Then 


the conclusion xQ in the canon 


Co@)P F xQ 


is provable if and only if xeN and Cg) = 1, since only then is the 
second premise CoP satisfied. Therefore xQ is provable if and only 


if xeQ, as desired. The entire cangnic system generating f is then 


- ip 
Co(HE f xQ 


QED 


The converse of this theorem is true, but a stronger result is 


possible. 


Theorem 2. Let the set & be defined only by canons of the form 
£1 (Xy9++ XE) Aver A fp XE, |. B(X,.-+-5*,)G. 


Then @ is recursive if 


-1 . 
g exists and is a total function 


1. 
2. £,,...,f£ are total 
1 m 
3. Freee oF are recursive sets 
4. B(x, >-+--5%)) depends on each variable x, in a non-trivial 
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Proof. Assume, with no loss of generality, that the range of g 


is the set of k-tuples z = Zjr-+-52,>. To show that € is recursive, 


we show how to determine whether ze@ or 2G for an arbitrary z. 


-1 
If g exists and is total, for an arbitrary k-tuple z = Byers rym 
we can compute ee) = Kypreee rk > Every variable that appears 
in the premises is some xy in eG) = Kyo eK > since g is assumed to 


depend upon all the variables non-trivially. Then, for each function 
Eyoeres 
knowing the values of Ei (Xpo- 1%); we can determine membership in 


ae 


tw we can compute f (x),-++5%,) since each f, is total, Then, 


OF since these are recursive sets. 


Formally, the characteristic function of & is computable for all 


Co(z) = Cp (Ey Oryy est, sn Oy (Eye o++,)) 


where eG) = Ayre X > and Coy yee Cey are the characteristic 
1 m 


functions of the recursive sets F rook. Since Coz) is total 
recursive, @ is a recursive set. QED 
| 


The General Decision Problem 


Theorems 1 and 2 relate certain canonic systems and recursive 
sets. This is not to say that recursive sets can only be described by 
systems of those forms, nor that any system with canons of another form 
can not generate a recursive set. We wish to sketch two results of 
Mandl [5] which explain the apparent inadequacy of our structure 


theorems. 


Theorem 3. It is undecidable whether an arbitrary canonic system 


generates a recursive set. 


Proof. A canonic system defines, in general, a recursively 
enumerable set, and it is undecidable whether an arbitrary recursively 


enumerable set is also recursive. QED 


_ Theorem 4. There is no class of canonic systems such that 


1. Every system in the class generates a recursive set. 
2. Every recursive set is generated by some member of the 


class. 


Proof. The proof requires a diagonalization argument which we 


do not wish to reproduce here. 


Summary 


For canonic systems generating sets of natural numbers, we have 
derived restricted forms of canons which guarantee that the system will 
generate a recursive set. The general question of deciding whether an 
arbitrary system generates a recursive set has already been shown un- 
decidable. . 
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Appendix II - Equivalent Systems 


Introduction 


In previous papers on canonic systems, only canons describing functions 
of strings have been considered. That is, the algebraic system has been . 
assumed to be the monoid (V*,+) of all finite strings over an alphabet V 
under the operation of concatenation. Under these conditions, the 
functions involved in the definition of a canon take on a particularly 


simple form. 


We desire the most general function f(x 2X) from n-tuples of 


be: 
strings into k-tuples. Since concatenation of variables and constants is 


the only operation available, that function is 


£(K,,--.5K%_) = W122, W111. W goerater pO Oe nee ee W > 
1 n 01°11 11 n,1i n,l Ok Ik 1k nk nk 


where zs46 {Xposeeax | and ws 5e v*, For example, a function over 


fa, b, c}* from ordered pairs into ordered pairs might be 
f(x, y) = <abcexy, axbycx>. 


Other authors have defined restricted forms of this general function, 


such as one in which each variable x eek is used exactly once. We 


1? 
desire to investigate some of these forms and show the relationship 
between them. The effect of this is to allow us to rewrite a system in 
an equivalent form which may be easier to manipulate in a particular 


instance, 


Predicates of Degree k to Indicated Context 


Definition 1. A canon with indicated context is a canon in which 


at least one function £ iy o+ 5%) in the premises contains a concatenation 


of a constant string and a variable. That is, 


EAS j acing) Mg Bi ig te a ae oO ont tte a ee 
1 1 k k 
whe aes Ke i 
re 255e {X, > x), wi 5e vV*, and at least one ae is not the empty 


string. 


This designation arises because the canon may be applied only if 
the context is matched. For example, the canon xaA A yB | <x, yo 


requires that a string ending in "a" be in A, 


Theorem 1. Any predicate of degree k > 1 can be reduced to a 


predicate of degree | and indicated context. 


Proof. A remark involving a predicate of degree k > 1 is of the 


* * 
form <s .,8,>P, where the range of each s;: (Vv sea +V is a single 


ee k 
string. Choose a symbol X not in V, and create a predicate Q of degree 
1 which contains the information in P in the form $,Xs,---Xs,Q. Wherever 


the quantity <s »S,>P appears, replace it by $)Xs5.--Xs,Q, which 


y7897°+ 9S, 
is of degree 1 and uses indicated context. QED 


Corollary 1. Any canonic system can be reduced to one in which no 


predicate is of degree greater than one, if indicated context is allowed. 


Proof. Apply the theorem to all predicates of degree greater than 


one, 


The converse of Theorem 1 is true, but the resulting predicate of 
degree k may still require indicated context. (A trivial proof just 


changes every predicate of degree one into degree two and makes the 
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information in each coordinate indentical.) The question to ask is whether 


the indicated context can be replaced by a simpler operation. This does 
not appear to be possible, as an example will show. Consider the canon 
xayA | xayB, which has the effect of placing in B all those strings in 
A which contain an "a'' anywhere. If we attempt to remove the indicated 
context and write something like <x, a, y>A, we have lost the ability to 
search for the "a'' within the string xay. As a matter of fact, it 
becomes necessary to allow terms of the form xy, where the division of 

a string into its x- and its y-component becomes completely arbitrary. 


For example, to simulate a premise remark of xayA A ..., we must write 


xA + <x, A, ADB. 
<ux, y, Z>B f <v, xy, z>B 


ww, xy, 2>B - mw, X, y2>B 
<x, 4, yOB A... 


where B is a temporary predicate introduced only to allow the searching 
from left to right. Since we feel such a canonic system is no simpler 
than. one allowing indicated context, we shall not attempt to state a 


converse to the theorem. 


Relation to Post Systems 


At this point, we wish to demonstrate the equivalence between 
canonic systems and Post systems. Although canonic systems 
evolved from Post systems, an exposition of the precess has never been 


made. 


A Post system is a set of rewriting rules of the form 
HQ ++ He BLY BOYD PY Pty 


where a, and By are constant strings over a finite alphabet V, and 


yye {X,.+++5%)}- For example, the rule 
xlly — x22yy 


“would change the string 333114 to 3332244, if x = 333 and y = 4. The 
left-hand part of a rule is called the antecedent, and the right the 


consequent. 


The distinguishing feature between Post and canonic systems is that 
a Post system allows only one workspace. All its rules refer to the 
same string, while a canonic system allows many such workspaces - one 
corresponding to each predicate. For example, the Post system generating 


all strings of a's of even length is: 


axiom’: aa 


rule : x -—» aax 


The analagous canonic system becomes 
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} aad 
A aaxA 


No more computing power can be added to a Post system. It is simply 
often more convenient to specify a process using the notation of canonic 


systems. 


Theorem 2. Any Post system oan be simulated by a canonic system 
using predicates of degree one. 


Proof. If the Post system has the axiom, let the canonic system 
have the rule + wW. If the Post system has the production 


0 Ry BLY Bo Yg ++ BeYmP tl’ 
let the canonic system have the rule 
By aa 8 + Oo E RA Le manne 


Then the set W describes exactly the strings the Post system generates. 


QED 


Observe that, in general, canons with indicated context will be 
required. The equivalence in the opposite direction is of interest 


only to show the construction required. 


Theorem 3, Any canonic system can be simulated by a Post system. 


Proof. We will only indicate the procedure involved in the 
simulation, since the details are tedious. We let the Post system 
build up a proof in much the same way that higher level systems do 
(Chapter 1). Markers must be left in the workspace to separate each © 
remark in the proof, and non-determinism must be allowed so that any 


canon can "reach" arbitrarily far back for remarks proven previously. 


Reduction of Indicated Context _to Cross Referencing 


Definition 2. A canon 


£, (kyr ee eX DE, A eon A fA Ckprer XE F G(X) 2+ sX GE 


where 


£ (x6. 05%) = Wes yoo St ee i Vag saves Mone Lk Fn ken 


and 254 € {Xyse0ee}, Ws 4 6 V* ig said to contain cross referencing if 
not all the 244 are distinct variables; that is, - = Z, does not imply 
that p = q for all p and q. 


Theorem 3. Any canon using indicated context may be reduced to a 


rule without indicated context if cross referencing is allowed. 
Proof. Consider a premise term of the general form 


oe Wd zw. G 


it gs a age re17*r"y 2 


where the w, are constant strings indicating contest, and the z, are 


variables. Form the rtl new canons 


I Hoy 


Fw A, 


bund, 


where the A, are new, distinct predicates. The intent of the original 


premise term can be simulated by 


x,A, A x,A r-1*r*r & + 


oAo 1 NAN vee A x AL A X,24%4 Zn 00 0X 


1 O11 2 


Since each A, contains exactly one element by construction, the effect 


> 
a. 
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of evaluating these premises is identical to that of applying the original. 


QED 


Summary 


We have shown that successive reductions from predicates of degree 
k to indicated context and from indicated context to cross referencing 
are possible. These manipulations indicate some of the relationships 


between canons of various types. 


3. 
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